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SEQUENCE LISTING 



<110> De Maria, Leonardo 
Andersen, Carsten 

Christensen, Lars Lehmann Hylling 
Lassen, Soren Flensted 
Ostergaard, Peter Rahbek 

<120> Protease Variants 

<130> 10508 . 204-US 

<140> 10574554 

<141> 2008-08-18 

<150> DK PCT/DK2004/000688 

<151> 2004-10-08 

<150> DK PA 2003 01494 

<151> 2003-10-10 

<150> DK PA 2004 00333 

<151> 2004-03-01 

<150> US 60/549,347 

<151> 2004-03-02 

<150> US 60/510,450 

<151> 2003-10-10 

<160> 21 

<170> Patentln version 3.5 

<210> 1 

<211> 1596 

<212> DNA 

<213> Nocardiopsis sp. NRRL 18262 ("Protease 10") 

<220> 

<221> CDS 

<222> (318) . . (1463) 

<220> 

<221> sig_peptide 

<222> (318) . . (404) 

<220> 

<221> mat_peptide 

<222> (900) . . (1463) 

<400> 1 

acgtttggta cgggtaccgg tgtccgcatg tggccagaat gcccccttgc gacagggaac 60 



ggattcggtc ggtagcgcat cgactccgac aaccgcgagg tggccgttcg cgtcgccacg 



120 



ttctgcgacc gtcatgcgac ccatcatcgg gtgaccccac cgagctctga atggtccacc 



180 



gttctgacgg tctttccctc accaaaacgt gcacctatgg ttaggacgtt gtttaccgaa 240 

tgtctcggtg aacgacaggg gccggacggt attcggcccc gatcccccgt tgatcccccc 300 

aggagagtag ggacccc atg cga ccc tec ccc gtt gtc tec gee ate ggt 350 

Met Arg Pro Ser Pro Val Val Ser Ala lie Gly 

-190 -185 

acg gga gcg ctg gee ttc ggt ctg gcg ctg tec ggt ace ccg ggt 395 
Thr Gly Ala Leu Ala Phe Gly Leu Ala Leu Ser Gly Thr Pro Gly 

-180 -175 -170 

gee etc gcg gec acc gga gcg etc ccc cag tea ccc acc ccg gag 440 
Ala Leu Ala Ala Thr Gly Ala Leu Pro Gin Ser Pro Thr Pro Glu 

-165 -160 -155 

gec gac gcg gtc tec atg cag gag gcg etc cag cgc gac etc gac 485 
Ala Asp Ala Val Ser Met Gin Glu Ala Leu Gin Arg Asp Leu Asp 

-150 -145 -140 

ctg acc tec gec gag gee gag gag ctg ctg gee gee cag gac acc 530 
Leu Thr Ser Ala Glu Ala Glu Glu Leu Leu Ala Ala Gin Asp Thr 

-135 -130 -125 

gec ttc gag gtc gac gag gec gcg gee gag gec gec ggg gac gec 575 
Ala Phe Glu Val Asp Glu Ala Ala Ala Glu Ala Ala Gly Asp Ala 

-120 -115 -110 

tac ggc ggc tec gtc ttc gac acc gag age ctg gaa ctg acc gtc ctg 623 
Tyr Gly Gly Ser Val Phe Asp Thr Glu Ser Leu Glu Leu Thr Val Leu 

-105 -100 -95 

gtc acc gat gec gee gcg gtc gag gec gtg gag gec acc ggc gec ggg 671 
Val Thr Asp Ala Ala Ala Val Glu Ala Val Glu Ala Thr Gly Ala Gly 

-90 -85 -80 

acc gag ctg gtc tec tac ggc ate gac ggt etc gac gag ate gtc cag 719 
Thr Glu Leu Val Ser Tyr Gly lie Asp Gly Leu Asp Glu lie Val Gin 

-75 -70 -65 

gag etc aac gec gee gac gec gtt ccc ggt gtg gtc ggc tgg tac ccg 767 
Glu Leu Asn Ala Ala Asp Ala Val Pro Gly Val Val Gly Trp Tyr Pro 
-60 -55 -50 -45 

gac gtg gcg ggt gac acc gtc gtc ctg gag gtc ctg gag ggt tec gga 815 
Asp Val Ala Gly Asp Thr Val Val Leu Glu Val Leu Glu Gly Ser Gly 

-40 -35 -30 

gee gac gtc age ggc ctg etc gcg gac gee ggc gtg gac gee teg gec 863 
Ala Asp Val Ser Gly Leu Leu Ala Asp Ala Gly Val Asp Ala Ser Ala 

-25 -20 -15 

gtc gag gtg acc acg age gac cag ccc gag etc tac gec gac ate ate 911 
Val Glu Val Thr Thr Ser Asp Gin Pro Glu Leu Tyr Ala Asp lie lie 
-10 -5 -1 1 



ggt ggt ctg gcc tac acc atg ggc ggc cgc tgt teg gtc ggc ttc gcg 
Gly Gly Leu Ala Tyr Thr Met Gly Gly Arg Cys Ser Val Gly Phe Ala 
5 10 15 20 



959 



gcc acc aac gcc gcc ggt cag ccc ggg ttc gtc acc gcc ggt cac tgc 1007 

Ala Thr Asn Ala Ala Gly Gin Pro Gly Phe Val Thr Ala Gly His Cys 

25 30 35 

ggc cgc gtg ggc acc cag gtg acc ate ggc aac ggc agg ggc gtc ttc 1055 

Gly Arg Val Gly Thr Gin Val Thr lie Gly Asn Gly Arg Gly Val Phe 

40 45 50 

gag cag tec gtc ttc ccc ggc aac gac gcg gcc ttc gtc cgc ggt acg 1103 

Glu Gin Ser Val Phe Pro Gly Asn Asp Ala Ala Phe Val Arg Gly Thr 
55 60 65 

tec aac ttc acg ctg acc aac ctg gtc age cgc tac aac acc ggc ggg 1151 

Ser Asn Phe Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn Thr Gly Gly 

70 75 80 

tac gcc acg gtc gcc ggt cac aac cag gcc ccc ate ggc tec tec gtc 1199 

Tyr Ala Thr Val Ala Gly His Asn Gin Ala Pro lie Gly Ser Ser Val 

85 90 95 100 

tgc cgc tec ggc tec acc acc ggt tgg cac tgc ggc acc ate cag gcc 1247 

Cys Arg Ser Gly Ser Thr Thr Gly Trp His Cys Gly Thr lie Gin Ala 

105 110 115 

cgc ggc cag teg gtg age tac ccc gag ggc acc gtc acc aac atg acc 1295 

Arg Gly Gin Ser Val Ser Tyr Pro Glu Gly Thr Val Thr Asn Met Thr 

120 125 130 

egg acc acc gtg tgc gcc gag ccc ggc gac tec ggc ggc tec tac ate 1343 

Arg Thr Thr Val Cys Ala Glu Pro Gly Asp Ser Gly Gly Ser Tyr lie 

135 140 145 

tec ggc acc cag gcc cag ggc gtg acc tec ggc ggc tec ggc aac tgc 1391 

Ser Gly Thr Gin Ala Gin Gly Val Thr Ser Gly Gly Ser Gly Asn Cys 

150 155 160 

cgc acc ggc ggg acc acc ttc tac cag gag gtc acc ccc atg gtg aac 1439 

Arg Thr Gly Gly Thr Thr Phe Tyr Gin Glu Val Thr Pro Met Val Asn 

165 170 175 180 

tec tgg ggc gtc cgt etc egg acc tgatccccgc ggttccaggc ggaccgaegg 1493 

Ser Trp Gly Val Arg Leu Arg Thr 

185 

tcgtgacctg agtaccaggc gtccccgccg cttccagcgg cgtccgcacc ggggtgggac 1553 

cgggcgtggc cacggcccca cccgtgaccg gaccgcccgg eta 1596 



<210> 2 
<211> 382 
<212> PRT 



<213> Nocardiopsis sp. NRRL 18262 ("Protease 10") 



<400> 2 

Met Arg Pro Ser Pro Val Val Ser Ala lie Gly Thr Gly Ala Leu 

-190 -185 -180 



Ala Phe Gly Leu Ala Leu Ser Gly Thr Pro Gly Ala Leu Ala Ala 

-175 -170 -165 



Thr Gly Ala Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala Val 

-160 -155 -150 



Ser Met Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Thr Ser Ala 

-145 -140 -135 



Glu Ala Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu Val 

-130 -125 -120 



Asp Glu Ala Ala Ala Glu Ala Ala Gly Asp Ala Tyr Gly Gly Ser 

-115 -110 -105 



Val Phe Asp Thr Glu Ser Leu Glu Leu Thr Val Leu Val Thr Asp Ala 

-100 -95 -90 



Ala Ala Val Glu Ala Val Glu Ala Thr Gly Ala Gly Thr Glu Leu Val 

-85 -80 -75 



Ser Tyr Gly lie Asp Gly Leu Asp Glu lie Val Gin Glu Leu Asn Ala 
-70 -65 -60 



Ala Asp Ala Val Pro Gly Val Val Gly Trp Tyr Pro Asp Val Ala Gly 
-55 -50 -45 



Asp Thr Val Val Leu Glu Val Leu Glu Gly Ser Gly Ala Asp Val Ser 
-40 -35 -30 -25 



Gly Leu Leu Ala Asp Ala Gly Val Asp Ala Ser Ala Val Glu Val Thr 

-20 -15 -10 



Thr Ser Asp Gin Pro Glu Leu Tyr Ala Asp lie lie Gly Gly Leu Ala 

-5 -11 5 



Tyr Thr Met Gly Gly Arg Cys Ser Val Gly Phe Ala Ala Thr Asn Ala 



10 15 20 



Ala Gly Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly Arg Val Gly 
25 30 35 40 



Thr Gin Val Thr lie Gly Asn Gly Arg Gly Val Phe Glu Gin Ser Val 

45 50 55 



Phe Pro Gly Asn Asp Ala Ala Phe Val Arg Gly Thr Ser Asn Phe Thr 

60 65 70 



Leu Thr Asn Leu Val Ser Arg Tyr Asn Thr Gly Gly Tyr Ala Thr Val 
75 80 85 



Ala Gly His Asn Gin Ala Pro lie Gly Ser Ser Val Cys Arg Ser Gly 
90 95 100 



Ser Thr Thr Gly Trp His Cys Gly Thr lie Gin Ala Arg Gly Gin Ser 
105 110 115 120 



Val Ser Tyr Pro Glu Gly Thr Val Thr Asn Met Thr Arg Thr Thr Val 

125 130 135 



Cys Ala Glu Pro Gly Asp Ser Gly Gly Ser Tyr lie Ser Gly Thr Gin 

140 145 150 



Ala Gin Gly Val Thr Ser Gly Gly Ser Gly Asn Cys Arg Thr Gly Gly 
155 160 165 



Thr Thr Phe Tyr Gin Glu Val Thr Pro Met Val Asn Ser Trp Gly Val 
170 175 180 



Arg Leu Arg Thr 
185 



<210> 3 

<211> 1065 

<212> DNA 

<213> Nocardiops is dassonvillei subspecies dassonvillei DSM 43235 ("Protease 18") 



<220> 
<221> 
<222> 



CDS 

(1) . . (1062) 



<220> 
<221> 
<222> 



mat_pept ide 
(499) . . (1062) 



<400> 3 

get ccg gec ccc gtc ccc cag acc ccc gtc gec gac gac age gec 45 

Ala Pro Ala Pro Val Pro Gin Thr Pro Val Ala Asp Asp Ser Ala 
-165 -160 -155 

gec age atg acc gag gcg etc aag cgc gac etc gac etc acc teg 90 

Ala Ser Met Thr Glu Ala Leu Lys Arg Asp Leu Asp Leu Thr Ser 
-150 -145 -140 

gee gag gee gag gag ctt etc teg gcg cag gaa gee gee ate gag 135 

Ala Glu Ala Glu Glu Leu Leu Ser Ala Gin Glu Ala Ala lie Glu 
-135 -130 -125 

acc gac gec gag gee acc gag gec gcg ggc gag gec tac ggc ggc 180 

Thr Asp Ala Glu Ala Thr Glu Ala Ala Gly Glu Ala Tyr Gly Gly 
-120 -115 -110 

tea ctg ttc gac acc gag acc etc gaa etc acc gtg ctg gtc acc gac 228 

Ser Leu Phe Asp Thr Glu Thr Leu Glu Leu Thr Val Leu Val Thr Asp 
-105 -100 -95 

gec tec gee gtc gag gcg gtc gag gec acc gga gec cag gee acc gtc 27 6 

Ala Ser Ala Val Glu Ala Val Glu Ala Thr Gly Ala Gin Ala Thr Val 
-90 -85 -80 -75 



gtc tec cac ggc acc gag ggc ctg acc gag gtc gtg gag gac etc aac 324 

Val Ser His Gly Thr Glu Gly Leu Thr Glu Val Val Glu Asp Leu Asn 

-70 -65 -60 

ggc gec gag gtt ccc gag age gtc etc ggc tgg tac ccg gac gtg gag 372 

Gly Ala Glu Val Pro Glu Ser Val Leu Gly Trp Tyr Pro Asp Val Glu 

-55 -50 -45 

age gac acc gtc gtg gtc gag gtg ctg gag ggc tec gac gec gac gtc 420 

Ser Asp Thr Val Val Val Glu Val Leu Glu Gly Ser Asp Ala Asp Val 

-40 -35 -30 

gee gee ctg etc gec gac gec ggt gtg gac tec tec teg gtc egg gtg 468 

Ala Ala Leu Leu Ala Asp Ala Gly Val Asp Ser Ser Ser Val Arg Val 

-25 -20 -15 

gag gag gee gag gag gee ccg cag gtc tac gee gac ate ate ggc ggc 516 

Glu Glu Ala Glu Glu Ala Pro Gin Val Tyr Ala Asp lie lie Gly Gly 

-10 -5 -11 5 

ctg gee tac tac atg ggc ggc cgc tgc tec gtc ggc ttc gec gcg acc 564 

Leu Ala Tyr Tyr Met Gly Gly Arg Cys Ser Val Gly Phe Ala Ala Thr 

10 15 20 

aac age gec ggt cag ccc ggt ttc gtc acc gec ggc cac tgc ggc acc 612 

Asn Ser Ala Gly Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly Thr 

25 30 35 



gtc ggc acc ggc gtg acc ate ggc aac ggc acc ggc acc ttc cag aac 660 

Val Gly Thr Gly Val Thr lie Gly Asn Gly Thr Gly Thr Phe Gin Asn 

40 45 50 

teg gtc ttc ccc ggc aac gac gec gec ttc gtc cgc ggc acc tec aac 708 

Ser Val Phe Pro Gly Asn Asp Ala Ala Phe Val Arg Gly Thr Ser Asn 

55 60 65 70 

ttc acc ctg acc aac ctg gtc teg cgc tac aac tec ggc ggc tac cag 756 

Phe Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn Ser Gly Gly Tyr Gin 

75 80 85 

teg gtg acc ggt acc age cag gec ccg gec ggc teg gee gtg tgc cgc 804 

Ser Val Thr Gly Thr Ser Gin Ala Pro Ala Gly Ser Ala Val Cys Arg 

90 95 100 

tec ggc tec acc acc ggc tgg cac tgc ggc acc ate cag gec cgc aac 852 

Ser Gly Ser Thr Thr Gly Trp His Cys Gly Thr lie Gin Ala Arg Asn 

105 110 115 

cag acc gtg cgc tac ccg cag ggc acc gtc tac teg etc acc cgc acc 900 

Gin Thr Val Arg Tyr Pro Gin Gly Thr Val Tyr Ser Leu Thr Arg Thr 

120 125 130 

aac gtg tgc gec gag ccc ggc gac tec ggc ggt teg ttc ate tec ggc 948 

Asn Val Cys Ala Glu Pro Gly Asp Ser Gly Gly Ser Phe lie Ser Gly 

135 140 145 150 

teg cag gee cag ggc gtc acc tec ggc ggc tec ggc aac tgc tec gtc 996 

Ser Gin Ala Gin Gly Val Thr Ser Gly Gly Ser Gly Asn Cys Ser Val 

155 160 165 

ggc ggc acg acc tac tac cag gag gtc acc ccg atg ate aac tec tgg 1044 

Gly Gly Thr Thr Tyr Tyr Gin Glu Val Thr Pro Met lie Asn Ser Trp 

170 175 180 

ggt gtc agg ate egg acc taa 1065 

Gly Val Arg lie Arg Thr 
185 



<210> 4 
<211> 354 
<212> PRT 

<213> Nocardiops is dassonvillei subspecies dassonvillei DSM 43235 ("Protease 18") 
<400> 4 

Ala Pro Ala Pro Val Pro Gin Thr Pro Val Ala Asp Asp Ser Ala 
-165 -160 -155 



Ala Ser Met Thr Glu Ala Leu Lys Arg Asp Leu Asp Leu Thr Ser 
-150 -145 -140 



Ala Glu Ala Glu Glu Leu Leu Ser Ala Gin Glu Ala Ala lie Glu 



-135 



-130 



-125 



Thr Asp Ala Glu Ala Thr Glu Ala Ala Gly Glu Ala Tyr Gly Gly 
-120 -115 -110 



Ser Leu Phe Asp Thr Glu Thr Leu Glu Leu Thr Val Leu Val Thr Asp 
-105 -100 -95 



Ala Ser Ala Val Glu Ala Val Glu Ala Thr Gly Ala Gin Ala Thr Val 
-90 -85 -80 -75 



Val Ser His Gly Thr Glu Gly Leu Thr Glu Val Val Glu Asp Leu Asn 

-70 -65 -60 



Gly Ala Glu Val Pro Glu Ser Val Leu Gly Trp Tyr Pro Asp Val Glu 

-55 -50 -45 



Ser Asp Thr Val Val Val Glu Val Leu Glu Gly Ser Asp Ala Asp Val 
-40 -35 -30 



Ala Ala Leu Leu Ala Asp Ala Gly Val Asp Ser Ser Ser Val Arg Val 
-25 -20 -15 



Glu Glu Ala Glu Glu Ala Pro Gin Val Tyr Ala Asp lie lie Gly Gly 
-10 -5 -11 5 



Leu Ala Tyr Tyr Met Gly Gly Arg Cys Ser Val Gly Phe Ala Ala Thr 

10 15 20 



Asn Ser Ala Gly Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly Thr 
25 30 35 



Val Gly Thr Gly Val Thr lie Gly Asn Gly Thr Gly Thr Phe Gl 



